P R O N U N C I at I O N M O D E L I N G F

نویسندگان

  • Alan W Black
  • Lori Levin
  • Florian Metze
  • Richard Sproat
  • Sunayana Sitaram
چکیده

Natural and intelligible Text to Speech (TTS) systems exist for a number of languages in the world today. However, there are many languages of the world, for which building TTS systems is still prohibitive, due to the lack of linguistic resources and data. Some of these languages are spoken by a large population of the world. Others are primarily spoken languages, or languages with large non-literate populations, which could benefit from speech-based systems. One of the bottlenecks in creating TTS systems in new languages is designing a frontend, which includes creating a phone set, lexicon and letter to sound rules, which contribute to the pronunciation of the system. In this thesis, we use acoustics and cross-lingual models and techniques using higher resource languages to improve the pronunciation of TTS systems in low resource languages. First, we present a grapheme-based framework that can be used to build TTS systems for most languages of the world that have a written form. Such systems either treat graphemes as phonemes or assign a single pronunciation to each grapheme, which may not be completely accurate for languages with ambiguities in their written forms. We improve the pronunciation of grapheme-based voices implicitly by using better modeling techniques. We automatically discover letter-to-sound rules such as schwa deletion using related higher resource languages. We also disambiguate homographs in lexicons in dialects of Arabic to improve the pronunciation of TTS systems. We show that phoneme-like features derived using Articulatory Features may be useful for improving grapheme-based voices. We present a preliminary framework addressing the problem of synthesizing Code Mixed text found often in Social Media. Lastly, we use acoustics and cross-lingual techniques to automatically derive written forms for building TTS systems for languages without a standardized orthography.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Amitraz Poisoning; A case study

A m i t r a z, a n i ns e c t i c i d e /a ca ri c i de of the f o r m a m i d i n e p e st i c i d e s group, is a ? 2 a d r e n e r g i c ag on i st a nd of t he a m i d i ne c h e m i ca l f a m il y generally us e d to c o n t r ol animal e c top a r a s i t e s. Poisoning due to am i t r a z i s r a r e and character...

متن کامل

شناسایی دگرگونی ژنتیکی در ویروس تب برفکی تیپ A با استفاده از تعیین ردیف نوکلئوتیدی قسمتی از ژن VP1 ویروس

,1s1ya:.J'$n :7tb !9y.tu;1 9; q 5t'1& p I r lt 4!lr 9 61,; .l.iy c.!! ob lr+ : tt 4jr.'j .Al JLrob.;qrolr,J'fll t q ..rlaLr. 4 RT-PCR ri,Sle 1r o.ri 6bi!l Jt-t5 RIA :.pe, .pe1 *q&,.5b.'.;9; VPr Oijl ,r,iiqo-f .ddisl-t,r-a.ttJ nlceloltl Jlhr""gbyp,glrrsc*l . d,ri$ multiplexRT-PCR cycle sequencing cie) jlosu:pl t r PCR Jr-e .ri . 'r-JJl9; a{.Jt Fluorescent dye deoxy-terminator ar.bgy.5bai...

متن کامل

شناسایی کلستریدیوم سپتیکوم به روش واکنش زنجیره ای پلیمراز (PCR)

li . a3"9rl aiL,;.61'l,.lo95g9s,r";lo.r;'l,q,Jtaer{J*-;*J5O+.t1r (PCR) al:9, .,rat5':.1"ri+.ltL" : g rb .a991la.iL;.5$.ir 95 2g;,r, A 9; *a llo'r,il.spyl.;-*l5ar-9ulA :ba!9al u9fe,rrl.lhrl ol..id4jlpbd I . 6.r^ir95o;U g9s'tn 31 6 tl ty. * y; : t;' 9 t l, PCR pt+l .bte-;IDNA 6l.g*l ,LJ:.1;"-J5d;t h+i5lt.#t^i .A.J'-6Jij j4'f.i 1l o'& u>lp uet a3l.5b.rol;;l osl'i;rl {f fupt{tl*J5 t;*Sl 9a:-...

متن کامل

تاثیر مرکزی هیستامین بر درد فرمالینی در خرگوش: نقش سیستم اپیوئیدی

.yiU s1 r qdJ 9 6!r.1 l,-r,iU 61 r,r1.11 ;oal il go,-7^19 ai f * *,-) S '*{'?'J*'fL' t:::tc: l/'2f /2 g.6;gb.t e*)* +r-,JifF.'L!.p"i9 r:.",ii:oul9+ 'Fft s ;J"r.pbjr;^*f- $ J9b +9 ll o;lr"iJ$!l odjUll Jt'56rlr2lr.! :rti9, ,(Jt;5) O*Jt"Jt i,;;ia.rJ1.pll oQr.i; nlCel ."r95f a+1,,r-..;ta 4..59L*yrtirlrcrbtUg i*r..p.259Fl'tfA,lflA.;lLirl,r6ek- 5r crrt!"rlr (efs.s..o' ) o*:..i1 Ji (.fsfefa),-;...

متن کامل

Computing Effective Pair Potentials from Pair Distribution Data

c l a s s M C A b s t r a c t O b j e c t { p u b l i c : / / D o a l o c a l m o n t e c a r l o s t e p . / / R e t u r n t r u e i f t h e s t e p w a s a c c e p t e d , f a l s e o t h e r w i s e . v i r t u a l b o o l s t e p ( ) = 0 ; } ; / / A b s t r a c t b a s e c l a s s f o r M C r u n s . c l a s s M C A b s t r a c t R u n { t y p e d e f v e c t o r < M C A b s t r a c t O b j...

متن کامل

A benchmark for testing adaptive systems on structured data

I n t h i s p a p e r , w e w i l l d e s c r i b e a g e n e r a l m e t h o d o l o g y w h i c h c a n b e u t i l i z e d i n t h e g e n e r a t i o n o f i m a g e r e c o g n i t i o n b e n c h m a r k p r o b l e m s , w h i c h c a n b e u s e d t o v a l i d a t e a n d v e r i f y l e a r n i n g a l g o r i t h m s f o r d a t a s t r u c t u r e s . T h i s m e t h o d o l o g y i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015